Corpus: swe_web_2002_100K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 99 99 99 99 99
1000 917 988 998 999 999
10000 7081 9486 9936 9987 9998
100000 45157 82843 96378 99145 99795
1000000 45158 82844 96379 99146 99796


Zipf's diagram for sentence endings


Gnuplot diagram

14664 msec needed at 2018-06-27 01:15